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ABSTRACT 

We discuss the properties of Ha emission stars across the sample of 22035 spectra from the Gaia-ESO Survey internal 
data release, observed with the GIRAFFE instrument and largely belonging to stars in young open clusters. Automated 
fits using two independent Gaussian profiles and a third component that accounts for the nebular emission allow us 
to discern distinct morphological types of Ha line profiles with the introduction of a simplified classification scheme. 
All in all we find 3765 stars with intrinsic emission and sort their spectra into eight distinct morphological categories: 
single-component emission, emission blend, sharp emission peaks, double emission, P-Cygni, inverted P-Cygni, self¬ 
absorption, and emission in absorption. We have more than one observation for 1430 stars in our sample, thus allowing 
a quantitative discussion of the degree of variability of Ha emission profiles, which is expected for young, active objects. 
We present a catalogue of stars with properties of their Ha emission line profiles, morphological classification, analysis 
of variability with time and the supplementary information from the SIMBAD, VizieR, and ADS databases. The records 
in SIMBAD indicate the presence of Ha emission for roughly 25% of all stars in our catalogue, while at least 305 of them 
have already been more thoroughly investigated according to the references in ADS. The most frequently identified 
morphological categories in our sample of spectra are emission blend (23%), emission in absorption (22%), and self¬ 
absorption (16%). Objects with repeated observations demonstrate that our classification into discrete categories is 
generally stable through time, but categories P-Cygni and self-absorption seem less stable, which is the consequence 
of discrete classification rules, as well as of the fundamental change in profile shape. Such records of emission stars 
can be valuable for automatic pipelines in large surveys, where it may prove very useful for pinpointing outliers when 
calculating general stellar properties and elemental abundances. They can be used in studies of star formation processes, 
interacting binaries, and other fields of stellar physics. 

Key words. Stars: activity - Stars: emission-line. Be - Stars: peculiar - Galaxy: open clusters and associations: general 
- Line: profiles - Catalogs 


* Based on data products from observations made with ESO 
Telescopes at the La Silla Paranal Observatory under pro¬ 
gramme ID 188.B-3002. These data products have been pro¬ 
cessed by the Cambridge Astronomy Survey Unit (CASU) at 


the Institute of Astronomy, University of Cambridge, and by 
the FLAMES/UVES reduction team at INAF/Osservatorio As¬ 
trofisico di Arcetri. These data have been obtained from the 
Gaia-ESO Survey Data Archive, prepared and hosted by the 
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1. INTRODUCTION 

The Gaia-ESO Survey (GES) is a large spectroscopic sur¬ 
vey that targets more than 10® stars in a magnitude range 
8 E < 19, systematically covering all major compo- 
nents of the Milky Way, from halo to star forming regions 


(Gilmore et al. 2012 Randich et al. 2013). With uniform 
mode of observation, well-defined samples, based primarily 
on ESO-VISTA photometry for the field stars, and by a 
variety of photometric surveys of open clusters, GES will 
quantify the kinematic-multielement abundance distribu¬ 
tion functions of stellar components in the bulge, the thick 
and thin discs, and the halo, as well as a significant sample 
of ~ 100 open clusters, covering a wide range of ages, stel¬ 
lar masses, metallicities, and distances from the Galactic 
centre. 

Within the GES, the analysis of different types of stars 
is divided into different working groups (WGs). WGIO is 


as a proxy for stellar ages. Another possibility is that they 
belong to the field and can be so far unrecognised interact¬ 
ing binaries, like symbiotic stars and cataclysmic variables. 
In this case GES provides a unique census of interacting 
binaries at all Galactic latitudes, thereby helping to sample 
the faint luminosity tail of their distribution that is under¬ 
represented in existing catalogues. 

The Ha emission profiles display a variety of shapes 
depending on the physical mechanism responsible for the 
emission and on the geometry of the source. The simplest 
profiles are symmetric with a Gaussian-like appearance and 
are presumably originated in one distinct source, but other 
more complex lin e profiles are f r equen tly o bserved. We re¬ 
fer the readers to Bonito et al. (2013) andjReipurth et al. 
(1996) for a discussion of the expected Ha profile fea- 


(Recio-Blanco et al. 

2014 and WGll of FGK-type UVES 

spectra (|Smiljanic e 

al. |2014 

1. WG12 focuses on sources 

in the held of young 
WG13 on OBA-typ 

clusters 

Lanzafame et al. 

20151 and 

e stars (I 

Homme et al. 

m 

Ia). WG14 


tures and their physical origin. Mechanisms that produce 
such multicomponent profiles are usually attributed to very 
young stars, cataclysmic variables, symbiotic stars, stars 
with massive outfl ows or inflows, and to many other types 
of active objects ([Hamilton et ah 20^, Kurosawa fc Ro- 


standard analyses produce uncertain or no results because 
of unusual spectroscopic features that are not properly ac¬ 
counted for by automatic pipelines. 

The Ha emission, indicative of accretion and outflow ac¬ 
tivity in young stars, is one of these outstanding features. 
Indeed, when not properly recognised, it can endanger the 
parameter or abundance determination analysis, for exam¬ 
ple because it affects the local continuum placement or the 
radial velocity determination. It was therefore decided, as 
part of the WG14 attributions, to systematically detect and 
characterise Ha emission. 

This paper presents a catalogue of 4459 Ha emission 
stars and their 8846 spectra across the sample of 22 035 
spectra of 12 392 stars observed with GIRAFFE instrument 
from the GES internal data release (first 22 months of obser¬ 
vations). Special care is taken to distinguish intrinsic stel¬ 
lar emission from the nebular one. Altogether, we find 7698 
spectra with intrinsic emission that belong to 3765 stars. 
Such stars are often found in stellar nurseries and nebular 
environments of young open clusters and can therefore serve 


manova ||2013[ [Biazzo et al.||2006[ |Lanzaiame et al.||2000p~ 

Many attempts have already been made to classify ac¬ 
tive objects of a certain type with the use of emission lines. 
An atlas of symbiotic stars was compiled in the context of 
a Ha morph ological classification usin g Gaussian fitting of 
l ine pr ofiles (Van Winckel et al.| 1993). Rauscher & Marcy 
(2006) demonstrate that a double-Gaussian model repro¬ 
duces the profiles of chromospheric emission lines using a 
sample of 147 K- and M-type stars. A temporal analysis 
of Rigel (/3 Ori) spectra showed an example of morpholog¬ 
ical classification of distinct Ha prof iles, which the author s 
classified visually into seven classes ( Morrison et al.|[2008 ). 
Another approach at an alysis of Ha profiles was shown by 
Kurosawa et al. (2006) where the authors used radiative 
transfer model to describe the various observed profiles of 
classical T Tauri stars and compared it with the morpho¬ 
logical classification scheme proposed by |Reipurth et al.| 

(ll^. 


Wide Field Astronomy Unit, Institute for Astronomy, University 
of Edinburgh, which is funded by the UK Science and Technol¬ 
ogy Facilities Council. This work was partly supported by the 
European Union FP7 programme through ERC grant number 
320360 and by the Leverhulme Trust through grant RPG-2012- 
541. We acknowledge the support from INAF and Ministero dell’ 
Istruzione, dell’ Universita’ e della Ricerca (MIUR) in the form 
of the grant "Premiale VLT 2012". The results presented here 
benefit from discussions held during the Gaia-ESO workshops 
and conferences supported by the ESF (European Science Foun¬ 
dation) through the GREAT Research Network Programme. A. 
Bayo Acknowledges financial support from the Proyecto Fonde- 
cyt Iniciacion 11140572. F. Jimenez-Esteban acknowledges fi¬ 
nancial support from the ARGHES project (7th Framework of 
the European Union, n 313146). This research has made use of 
the SIMBAD database and VizieR catalogue access tool, CDS, 
Strasbourg, France. The original description of the VizieR ser¬ 
vice was published in A&AS 143, 23. Tables and are 

only available in electronic form at the CDS via anonymous 
ftp to cdsarc.u-strasbg.fr (130.79.128.5) or via http://cdsweb.u- 
strasbg.fr/cgi-bin/qcat?J / A-fA/ 


The above studies have mostly dealt with specific types 
of stars on smaller samples of several dozens of objects. In 
this paper we do not limit ourselves to any distinct type 
of stars, as this would require customised treatment when 
identifying and analysing the very diverse physical condi¬ 
tions purely from observed line profiles. Instead, we try to 
be impartial towards distinct physical characteristics and 
treat all available spectra in a uniform manner. In this 
way, we are able to review and morphologically classify sev¬ 
eral thousand objects without particular assumptions and 
model dependent parameters. 

We approach the problem first with the detection of Ha 
emission in spectra and secondly, we introduce a simple 
morphological scheme for their classification. This classi¬ 
fication scheme, together with supplementary information 
about detected objects, like their classification from the lit¬ 
erature where available, might prove useful to those who 
deal with the aforementioned types of active objects and 
could serve as a uniform and unbiased treatment of largely 
diverse emission profiles. 

In Sect. 2 we present the data used in this study and 
the methods of reduction. The results of classification and 
temporal variability of objects are presented in Sect. 3. Dis¬ 
cussion and conclusions follow in Sect. 4. 
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Fig. 1: Whole sky representation of the 20 fields marked by circles. At the top right of each circle, we indicate the number 
of objects within a given field, where one-object fields represent benchmark stars. 


2. DATA AND REDUCTION OVERVIEW 


The GES employs the VL T-UT2 FLAMES high -resolution 
multiobject spectrograph ( iPasquini et al.|[2002 l. FLAMES 
has two instruments, the higher-resolution UVES, fed by 


8 fibres, and GIRAFFE, with about 130 fibres. In the 
present paper we concentrate on the much more numer¬ 
ous GIRAFFE spectra to thoroughly test our classification 
scheme; a similar analysis of the UVES spectra will follow 
later. 

We use only spectra obtained in the wavelength range 
with Ha line - HR15N setting (6470-6790 A, R=17 000), 
for which the majority of targets selected by GES are open 
cluster stars. Along with ongoing GES observations, we also 
use all relevant ESO archive data, which are included to be 
analysed as part of GES. Individual spectra can therefore 
be dated up to nine years before the start of the survey. For 
objects with more than one spectrum from either new or 
archival data, the timespan of repeated observations ranges 
from two days up to one year, which enables us to consider 
their temporal variability. 

In this study, we use only spectra before sky subtraction 
and normalisation. Doing so, we avoid any wrong identifi¬ 
cation or inaccurate subtraction of nebular emission lines, 
which may show a significant spatial variability in the envi¬ 
ronment of young clusters. Out of altogether 28 957 spectra 
from the GES internal data release, we have 22 035 object 
spectra (i.e. 76% of the whole data) while the remaining 
6922 spectra are measurements of the background sky sig¬ 
nal. The distribution of targets on the sky is shown in Fig. 

m 


Next we describe the methods used to characterise spec¬ 
tral Ha emission. Here we give just a brief overview, and 
more details are given in Appendix [A| 

The entire analysis and the final results are based on 
spectra reduced to the heliocentric frame of reference. Nor¬ 
malisation was done by dividing the spectra with the me¬ 
dian value of the flux in a window of width 20 A around 
the Ha line. In the first step of the line profile analysis, we 
determinated the separation of absorption-dominated and 
emission-dominated profiles. This was done by analysing 


the steepness and strength of the left and right part of the 
line profile. Out of 22 035 spectra, we identified 9265 emis¬ 
sion profiles, taking all those with one or more Ha emission 
components into account. Some of these spectra represent 
repeated observations of the same star so the number of 
unique emission type objects is 4566 out of 12 392. 

Both simple and more complicated multicomponent 
profiles are present in our dataset (Fig. [^. The goal of 
further examination was to determine the number of sepa¬ 
rate emission or absorption components in each profile and 
whether a nebular component is present. Glearly a solution 
with more components should always yield a better fit. But 
blending of individual components often makes the solu¬ 
tion degenerate, while large intensity ratios make the claim 
of multiple component detection insignificant. To avoid de¬ 
generacy and to increase uniformity of treatment of a wide 
variety of line profiles, we decided to fit each profile by the 
combination of two independent Gaussian profiles either in 
emission or absorption, hereafter denoted as gi and g 2 , and 
an optional third one, denoted gn, to account for a nebular 
emission. We assume Ha as the source of all components, 
neglecting possible contribution from [He H] absorption line 
(6560.15 A) in hot stars. 

Each Gaussian profile has three parameters for which we 
use a standard notation (Ai - centre of the peak, <7^ - stan¬ 
dard deviation, Ai - height of the peak, i = 1, 2, n). Resort¬ 
ing to one Gaussian profile is not enough to describe diverse 
multicomponent profiles, while 3 or more are plagued by 
degeneracy. The two component fit on the other hand de¬ 
scribes well the emissions with self-absorption, P-Gygni or 
inverted P-Gygni profiles, or narrow emission peaks within 
a dominating absorption. The optional third (nebular) com¬ 
ponent is included in the fit if there is strong indication of 
the nebular emission. This indication comes from the pres¬ 
ence of [N H] emission lines in the object spectrum or from 
the presence of Ha emission line in the sky spectra from the 
region surrounding the object. In both cases, the nebular 
component is assumed to be in emission, where its wave¬ 
length and width are fixed by [N H] lines and Ha line in 
the sky spectra, as explained in Appendix |A.2| 
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Fig. 2: Examples of GES spectra with Ha in emission before normalisation and sky subtraction, plotted in heliocentric 
wavelengths. Dashed lines mark the adopted continuum level. 


Our analysis is based exclusively on automated fits. 
This should improve the uniformity of the results and avoid 
arbitrary decisions required in manual fitting. Automated 
multicomponent fits can be computationally intensive with 
results reflecting local minima of the solution, thus miss¬ 
ing the global one. To avoid these problems we adopt a 
sampling scheme for all the Gaussian parameters except 
intensities, which are fitted with a least-squares Leven- 
berg~Marquardt algorithm. We fix the A„ and values 
for gn based on the evaluation of Ha and [N II] emission, 
while we leave the peak height (A„) to vary in the fit. The 
Qn component is included in the fit only if it satisfies cri¬ 
teria, as detailed in Appendix |A.2| We sample the a pa¬ 
rameter of gi and g 2 (hereafter cti^ 2 ) logarithmically with 
15 values ranging from 0.15 to 2 A. The A parameter of 
gi and g 2 (hereafter Ai^ 2 ) Siie, in the first iteration of the 
fit, distributed linearly with 21 values on a flexible interval 
around the detected centre of Ha emission. Justification of 
the choice of these intervals is given in Appendix |A.2[ The 
fit is then iterated so that the final precision for the Ai ^2 
parameters reaches 0.1 A. At this point we assume that the 
Ai 2 values from the first iteration are close to the global 
minimum, so we use a better sampling to investigate the pa¬ 
rameter space only around these values. The peak heights 
^ 1,2 of gi and g 2 are allowed to vary continuously, as for 
9n- 

The raw results of this fitting procedure are 9 or 6 Gaus¬ 
sian parameters (Ai, 2 ,nj o’i, 2 .nj ^i, 2 ,n) for 3 or 2 fitted Gaus¬ 
sian components respectively along with the value of the 
goodness of fit defined as 

xl=(^[0-C]^/iN-N,,r) ( 1 ) 

where N is the number of data points and War is the 
number of input parameters to the least-squares method. 
We prefer not to use the expected a (dispersion) of the 
noise based on the S/N (signal-to-noise ratio) of the spectra 
since it might be influenced by systematic errors (e.g. 
uncertain placement of continuum, imperfect flat fielding) 
and statistical (Poisson) noise. Because it is physically 
plausible that the central wavelength of emission and 
absorption profiles are the same, or that an emission profile 
has a narrow core and broad wings, which cannot be fitted 
with a single Gaussian, we also perform this fitting proce¬ 
dure by setting Ai = A 2 . By doing this, we anticipate to get 
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better solutions of the fit in the case where two components 
in line profiles are well aligned and the original procedure 
misses the right initial values of A to have the wavelengths 
match exactly. This version of fitting produces better fits 
in terms of Xr for 402 spectra. As these solutions are also 
meaningful in the physical sense we adopt these new values. 


3. RESULTS 

The results from the fitting procedure for 8846 out of 9265 
spectra are contained in the catalogue described in Ap¬ 
pendix]^ We disregard 396 spectra due to the strong neg¬ 
ative values of flux around Ha in the spectra before nor¬ 
malisation (see Appendix [A| . We also disregard 23 (i.e. less 
than 0.3% of all) spectra with absorption profiles, which 
were misidentifled as emissions in the first step of the data 
reduction (e.g. due to noise or cosmic rays). In the pre¬ 
sentation of results, the velocity scale is often used for the 
Gaussian parameter A (thus denoted by v), with zero at the 
Ha centre. 


3.1. Stars with repeated observations 

The fitted sample of 8846 spectra represents data for 4459 
unique objects. We therefore have 3029 objects in the cata¬ 
logue represented with one spectrum and 1430 objects with 
repeated observations. The latter can be used to illustrate 
the temporal variability of Ha line profiles. They are also 
used for determination of the free parameters in our mor¬ 
phological classification scheme. The number of spectra for 
each of the 1430 objects varies from 2 t o 21 (Table [^. T he 


GES introduced an observing strategy (Bragaglia et al. 


prep. 


, which enables such existence of multiple spectra per 
star. Many of these spectra are also from the ESO archive 
data. 


3.2. Morphological classification 

Multiple Gaussian profile fit described in the previous sec¬ 
tion is a morphological one, so it is not necessarily related 
to the underlying physics of observed emission object. We 
deliberately choose a simple scheme using only up to two 
Gaussian profiles to describe the Ha line shape intrinsic to 
the object. Hence we can use a limited number of param¬ 
eters to classify all the 4459 analysed objects with their 
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Table 1: Frequency of multiple observations. The majority 
of objects (3029) have been observed once. Among the 1430 
sonrces observed several times, 120 have over ten observa¬ 
tions. 


N spec. 

1 

2 

3 

4 

5 

6 

7 

8 

N obj. 

3029 

784 

295 

149 

55 

17 

9 

1 

N spec. 

9 11 

12 

15 

16 

18 

19 

20 

21 

N obj. 

1 2 

2 

1 

4 

2 

14 

90 

4 


Table 3: Fractions of categorisations of the same object. 


prevalent Categories of spectra of the same object 


category 

Ew 

Esp 

Erfp 

Pcya 

^'Pcyg 

Sabs 

Eabs 

Ebi 

80.2 

0.8 

3.0 

0.4 

0.5 

13.5 

1.5 

Esp 

1.4 

80.0 

0.1 

0.5 

4.0 

3.6 

10.4 

Efip 

4.0 

0.1 

73.6 

1.7 

1.3 

18.6 

0.6 

^Cyg 

1.9 

9.7 

1.0 

64.0 

3.5 

3.3 

16.7 

IPcyg 

1.4 

5.2 

0.1 

0.2 

77.2 

5.3 

10.5 

Sabs 

12.1 

2.5 

13.2 

1.5 

1.8 

67.1 

1.9 

Eabs 

0.3 

4.3 

0.1 

1.7 

5.3 

1.6 

86.7 


8846 spectra by meaningful morphological categories and 
possibly also by physical ones. 

Classification is enabled by identification of the fitted 
components described in Appendix]^ which produces nine 
distinct flags for all emission-type objects. Employing this 
identification, we now focns on the more interesting profiles 
with two intrinsic components flagged EE(N), EA(N), and 
AE(N). Letters E, A, and N denote intrinsic emission, in¬ 
trinsic absorption, and nebnlar emission, respectively. Flag 
EA represents a profile where emission is relatively stronger 
than absorption, and the opposite is trne for AE. For more 
information on flags see Appendix [B| 

We establish seven classification categories to infer some 
correspondence to underlying physics for these spectra (ob¬ 
jects). The proposed seven categories with the additional 
two categories for nebnlar and single-component intrinsic 
emission profiles are listed in Table together with condi¬ 
tions for their assignment. This classification scheme nses 
three parameters {Kee^ Kcyg, Kgp), which are described 
later in this section, and their valnes are determined with 
the help of repeated observations. The latter is done nsing 
only 1191 objects for which we have at least two spectra 
flagged as EE(N), EA(N), or AE(N). 

In Table we sort 4443 spectra of 1191 objects in rows 
and columns according to the scheme in Table Each ob¬ 
ject contributes the number of its spectra (categories) into 
the row that corresponds to the most frequent (prevalent) 
category for all spectra of that object. If several (N) cat¬ 
egories are assigned the same (highest) number of spectra 
for a given object, then that object contribntes to the corre¬ 
sponding N rows, and the number of spectra with a certain 
category is divided by N before being added to the columns. 
Finally, values in each row are converted to fractions of the 
row’s snm. Diagonal values indicate the stability of clas¬ 
sification categories with time. We note that the table is 
not diagonally symmetrical, which would be the case if all 
objects had two spectra only. Here however, the category 
that is in minority for a certain object contributes to the 
off-diagonal elements while the prevalent category does not, 
except where prevalence is shared by more than one cate¬ 
gory. 

The parameters Kee = 0.9, Kcyg = 0.9 and K^p = 50 
km s“^ are selected so that they maximise the overall sta¬ 
bility of the classification categories. It is physically and 
morphologically sensible that we relate the degree of sepa¬ 
ration of two components to a function of the components’ 
widths (\/oY^H 7^), whereby we add an arbitrary constant, 
which shonld be close to 1. This constant is then determined 
by the stability of categories with time, which is indicated 
by the diagonal elements in Table Our conditions from 
Tablej^and the parameters KEE,Kcyg, and Kgp are more 


snpportive of the classification scheme if the diagonal ele¬ 
ments are high for all categories. Conversely, if we get many 
significant off-diagonal elements, this might point to the ac¬ 
tual morphological change of profiles for certain objects. We 
especially expect the latter from 108 objects with at least 
10 spectra (Table [^, which might be targeted through their 
variable nature. 

Following the scheme in Table we assign a classifica¬ 
tion category to all 8846 spectra. The last two colnmns in 
Table give an overview of the number of objects that be¬ 
long to each category. The fractions in the last colnmn are 
merely indicating the properties of onr sample and are not 
intended to demonstrate any general population character¬ 
istics. 

We observe two subtypes in the E flagged (single¬ 
component emission) spectra. The first one consists of very 
narrow profiles where the mean value of the a parameter of 
fitted Gaussian profiles is at ^ 0.29 A. This conld indicate 
the threshold for the width of the profile above which we 
are able to discern distinct components, and can serve as a 
confirmation of the high quality/resolution of GES spectra. 
The second subtype corresponds to Gaussian profiles with 
much larger widths (up to 2 A), which are also relatively 
strong, so that additional components are discarded by the 
rnleset from Fig. [Tsl 


3.2.1. Description of morphological categories 

We describe the ensemble properties of spectra for each of 
the categories nsing Figs. i-m All flgnres are composed of 
the same number and type of panels, although the reader 
must be cautious when comparing different categories be- 
canse of the different ranges on the horizontal axes. The dif¬ 
ferent types of histograms illustrate the qualitative (shape) 
and quantitative differences between the classification cat¬ 
egories based on the parameters of gi and g 2 (denoted also 
with indexes b,r,A,E for bluer, redder, absorption, and 
emission component, respectively). 

Gategory Eh; represents profiles with blended gi and g 2 ■ 
We observe the slowly falling wing of the profile on either 
side. Widths of both components can be very different. The 
redder component is normally centred on nebular velocity, 
which might also be a consequence of the nebular emission 
and the presence or lack of the correspondingly fitted 
component. The bluer component seems to be weaker and 
narrower on average and its shift from the systemic (neb¬ 
ular) velocity might be due to intrinsic properties (e.g. an 
approaching wind). 

Gategory Egp describes profiles with peaks at separa¬ 
tions between 20 and 40 km s“^. The redder and sharper 
component is normally centred on nebnlar velocity, which 
conld be dne to fitting of the nebnlar component with both 


Article number, page 5 of 


16 







A&iA proofs: manuscript no. articlel_submitted_arxiv 


Table 2: Classification categories and corresponding conditions. 


Category 

Flags 

Condition 1 

Condition 2 

N spectra 

Fraction 

Nebular emission 

N 



1148 

I3^ 

Single-component emission 

E(N) 



990 

11.2 % 

Emission blend (Eti) 

EE(N) 

|Ai — A 2 I ^ KpEiiTi + 


1729 

19.6 % 

Sharp emission peaks (Esp) 

EE(N) 

|Ai — A 2 I > Kee{o\ + 

Ul - U2 < Ks-p 

765 

8.6 % 

Double emission (Edp) 

EE(N) 

|Ai — A 2 I > Kee{o\ + 

Ul - 172 ^ Ksp 

545 

6.2 % 

P-Cygni (Pcpg) 

EA(N), AE(N) 

^Em ~ Aa6s > ^Cygip'Em ^Abs) ^ 


154 

1.8 % 

Inverted P-Cygni (IPcgg) 

EA(N), AE(N) 

Aa&s ^Em ^ ^Cygip^Em ^ Abs) ^ 


610 

6.9 % 

Self-absorption (Sots) 

EA(N), AE(N) 

|Aa6s ~ ^Em\ ^ ^Cygip'Em ^Abs) ^ 

Cf Abs Em 

1253 

14.2 % 

Emission in absorption (Eats) 

EA(N), AE(N) 

|Aa&s ~ ^Em\ ^ ^Cyg{(^Em ^Abs) ^ 

Ahs ^ ^ Em 

1652 

18.7 % 


8846 lOO^ 


Qn and gr, in which case only the bluer component is in¬ 
trinsic. 

Profiles with larger peak separations (between 100 and 
250 km s“^) are characterised by E^p category. Widths are 
between 25 and 100 km s“^ so the peaks are well separated. 
Peaks can be very different, but distributions of width (o'b,r) 
and flux ratios are symmetric. 

Category Pcyg accounts for profiles with absorption and 
emission components that are similar to P-Cygni profiles. 
The redder emission peak is roughly at systemic (nebular) 
velocity. Absorption component is usually weaker, which is 
a common feature for this type of profiles. 

Spectra with profile shape features that are generally 
inverse to those of P-Cygni are represented by IPcyg cate¬ 
gory. The absorption component is wider, but velocity sep¬ 
aration of components is mostly small. Flux ratio of emis¬ 
sion and absorption ha s a large span. As one of the ph ysi- 
cal interpretations see Kurosawa & Romanova (20131 for 
demonstration of the quasi-stationary appearance of the 
red-shifted absorption in T Tauri stars due to the unsta¬ 
ble regime of the accretion. 

Category Sa&s describes profiles with self-absorption. 
Emission is roughly centred on the nebular velocity and 
absorption can be blue or red shifted. 

The last category E^bs identifies profiles with emission 
in absorption, which may be normal stellar (photospheric) 
Ha absorption. Narrow emission is moderately blue-shifted 
and the flux ratio has a large span where either component 
may be dominant. 


3.3. Temporal variability from repeated observations 

The temporal change of the profiles can be evaluated first in 
terms of classification categories from Table We identify 
660 out of 4459 objects that exhibit no change in classifica¬ 
tion category through time (Fig. 101, and can be labelled 
as objects with stable profile shape. 

Furthermore, we can measure the variability of Ha pro¬ 
files of a certain object by comparing their flux F (Equation 

g ). By obtaining the standard deviation of the flux (ctf) 
dded by its mean (pp) for all spectra of a certain ob¬ 
ject, we can get an unbiased estimate of the flux variability 
whose distribution is shown in shown in Fig. [m We note 
that no timespan between the first and last observations is 
longer than 1 year and we observe no evident correlation 
between this timespan and the flux variation. 

The variability index displayed in Fig. El is a measure 
of the profile change, where a high value of ap/pF almost 
certainly indicates a strong variability while a small value of 
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Emission blend (E^) 


CIMAME: 06402373+0955239 
DATEOBS: 2012-10-27706:45:32.156 




25 50 75 100 

v^-v^ [km 




Fig. 3: Category Eyi- Panel (a) represents the typical 
continuum-normalised Ha profile for objects in this cate¬ 
gory (a spectrum without detection of prominent nebular 
emission lines was chosen for clarity). We also plot each 
Gaussian fit {gi,g 2 - dashed, gn - dotted) along with their 
sum (thick black-white line). The horizontal axis is centered 
on Ha at 6562.8 A. The panels (b) to (f) show the distribu¬ 
tion of fitted parameters for all the sources of this category. 
Parameters for the nebular component are only shown in 
panel (c) as a reference point for zero wavelength and the 
number of spectra is therefore different for that histogram, 
since not all spectra have the additional N flag. Indices 
b,r,A,E denote blue (shaded histogram), red (thick-lined 
histogram), absorption (shaded histogram), and emission 
(thick-lined histogram) component, respectively. 


the same parameter does not exclude relevant variations of 
the profile shape, because it is only related to the integrated 
flux. Nevertheless, we can assess which objects with re¬ 
peated observations exhibit minimum temporal variability. 
This can be done by taking o bjec ts with 0 relative change 
of classification category (Fig. 101 and taking the variabil¬ 
ity of flux F from Fig. E] into account. A more complete 
variability analysis is out of scope of this work and will be 
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Sharp emission peaks (E^^) 

CNAME: 07561129-6050580 




Fig. 4: As Fig. |^but for category E^p. 


Double emission (E^^) 


CNAME: 06410098-10932444 
DATEOBS: 2012-12-07705:30:44.959 



Fig. 5: As Fig. but for category 'Edp- 


addressed in future papers dealing with particular physical 
classes of stars with Ha emission. 

3.4. Resemblance of categories 

According to our results, there are some cases of objects for 
which the classification of the Ha profile changes with time. 
This might be due to the significant change in the actual 
profile shape or to the discrete conditions in our morpholog¬ 
ical classification scheme. The correlation of one category 


P-Cygni {Pcyg) 

CNAME: 16265744-2449495 


DATEOBS: 2005-05-01709:25:07.382 









Fig. 6: As Fig.j^but for category Pcyg- 
Inverted P-Cygni (IPpyj) 

CNAME: 08072682-4712412 


DATEOBS: 2012-01-03706:00:44.876 



Fig. 7: As Fig. [^but for category IPcyg- 


with the other is indicated by the off-diagonal elements in 
Table Generally, there is one other category (column) 
per eadi category (row) that stands out and points to their 
correlation. 

When Ef,i is the prevalent category for an object, it 
is most often in combination with Sabs, which is best ex¬ 
plained by one of the two components being in transition 
between absorption and emission. Similarly, the category 
Esp can change to E^bs if a relatively weak absorption is 
constantly present and one of the emission peaks dimin¬ 
ishes. Alternatively, one of the emission peaks can be rela- 
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Self absorption (S^ 


CNAME:06404989+0936495 
DATEOB5: 2011-12-23704:52:26.609 
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Fig. 8: As Fig.|^but for category Sabs- 


Emission in absorption (E 


abs‘ 


CNAME:06401015+1007328 
DATEOBS: 2012-10-27707:49:49.697 




25 50 75 100 

v^-v^ [km ] 
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400 

300 

200 

100 

0 
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Fig. 9: As Fig. [^but for category Eats- 


tively weak, as indicated in Fig. 17 (panel (f)) and in Fig. 12 
(panel 5), which could result in Tire preferential fitting ofa 
wide, low-amplitude absorption in the best fit procedure. 

The largest off-diagonal element connects the prevalent 
category E^p with Sabs- The distinction between the two 
categories is largely influenced by the inclination of the 
slopes in the profile that are liable to change in the pres¬ 
ence of additional weaker components or they are harder to 
retrieve in the case of more noisy spectra. 

The categories Pcyg and IPcyg both exhibit noticeable 
connection to Eohs. This can be due to their morphologi¬ 


10 ^ 

10 ^ 

10 ^ 

10 ° 


0 1 


no change in classification category 


23456789 10 11 12 13 14 15 16 17 18 19 


N exposures 


Fig. 10: Objects with stable profile shape. Horizontal axis 
represents the number of observations (spectra) for each 
object and the number of objects is on the vertical axis. 
There are altogether 660 objects belonging to a unique clas¬ 
sification category. Among them, 456 objects were observed 
twice, 134 sources were three spectra, and the remaining 70 
sources have over 3 observations. 


120 < AT. 


last—first 


^^last—first ^ 



Fig. 11: Distribution of variability ap/^p (standard de¬ 
viation of F divided by its mean) for the 1430 objects 
with more than one observation. The histogram shows four 
groups of different timespan ranges A T in days. Objects 
with the longest timespan (white) show small variability, 
while some of those with the shortest timespan (black) can 
exhibit significant changes. Objects within the group of the 
third longest timespan (light grey) display the strongest 
variation of their Ha profile. 


cal similarity in cases where the fitted profile properties are 
close to the limiting conditions from Table where it is suf¬ 
ficient that one of the fitted components’ width or their sep¬ 
aration changes slightly and the category shifts (e.g. due to 
the intrinsic change in velocity or orientation of the stellar 
accretion/outflow). Another option is demonstrated with 
the object in Fig. [T2l where the change of Pcyg to E^bs 
category is due to intrinsically different profiles. The width 
of the absorption component in panels 1 and 4 is very dif¬ 
ferent from that in panels 3 and 5. We also notice how the 
bluer side of the emission component changes, due to the 
presence of another emission component, and how this in¬ 
fluences the morphologically different fit in panel 2. The 
object in Fig. has a pronounced and intrinsic variability 
of its Ha profiles. This is justly reflected also in our mor¬ 
phological classification scheme. Such intrinsic changes are 
however relatively rare, as indicated by the prevalence of 
diagonal elements in Table 
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Fig. 12: Typical spectrum of object with a pronounced in¬ 
trinsic variability of emission profiles for which the cate¬ 
gories Pcyg and Eabs are prevalent. Spectra are ordered 
by date of observation given in the title of each panel. The 
sum of all fitted components is represented by the thick 
black-white line. Continuum is at 1. 


4. DISCUSSION AND CONCLUSIONS 

Gaussian fitting is one of the possibilities for analysis of 
single and multicomponent Ha profiles. The methods that 
we discussed here were used to first detect and then classify 
spectral line profiles in a uniform and unbiased manner. In 
this paper we limit ourselves to the Ha line, although this 
method could be applied to other spectral lines as well. 
From the 22 035 GIRAFFE spectra in the GES internal 
data release, we found 9265 spectra displaying emission¬ 
like features. This amounts to 4566 out of 12 392 unique 
objects, which represents a 36.8% detection rate. The high 
percentage of detected emissions is expected since a large 
fraction of all objects observed by GES with the HR15N 
setup, which includes the Ha line, are in the field of young 
open clusters. 

The Gaussian fitting produces results for 8869 out of 
9265 analysed spectra, the difference being mainly due to 
the spectra with negative fluxes (see Appendix [A|) . Fitting 
is done with two independent Gaussian profiles and a third 
one to account for the nebular emission component, when¬ 
ever necessary. The parameters of these three Gaussian pro¬ 
files enable us to construct a simplified morphological clas¬ 
sification. First, we flag all spectra (Appendix]^ following 
the ruleset in Fig. [T^ and summarise the result in Table 
Less than 0.3% or 23 out of 9265 spectra are flagged as hav¬ 
ing only absorption components, which was subsequently 
confirmed by eye inspection and therefore proves the effi¬ 
ciency of our detection routine. These spectra are excluded 
from further consideration where we use the classification 
scheme described in Table to assign morphological cat¬ 
egories to all spectra given in the online catalogue, whose 
contents are described in Tables mill and[^ Thus we have 
8846 Ha emission spectra in the catalogue corresponding 
to 4459 unique objects. The list of ADS references includes 
records about 305 or 7% of these objects (Table , while 
for roughly 25% of them, the presence of Ha emission is 
already indicated by SIMBAD. 

We find that 2239 analysed objects (4049 spectra) have 
an indication of the presence of a nebular emission. This 
evaluation depends on the assumption that nebular [N H] 
emission lines correlate well with Ha emission line and that 
the nebular emission is more or less smooth over a certain 
field of view or across individual clusters, so that the width 
and radial velocity of nebular lines do not show abrupt spa¬ 
tial changes. Therefore, A„ and (T„ parameters were fixed 
during the Gaussian fitting procedure. Although the [N H] 
emission lines can be explained by the nebular contribution, 
we caution that they can also be present due to the out¬ 
flow activity of the star, so it is possible that in some cases, 
the fit of the nebular component corresponds to some other 
than the strictly nebular contribution to the line profile. 

Overall statistics of the fitted components to Ha pro¬ 
files is given in Fig. We note that the high values of 
A parameter mostly correspond to the very strong emis¬ 
sion lines while sometimes, the best fit was produced with 
two strong Gaussian components, one in emission and the 
other in absorption, that summed together to represent a 
relatively weaker component. The nebular (sky) lines are 
fitted with only one Gaussian profile to avoid degeneracy 
issues, although in some cases, we acknowledge the possi¬ 
ble existence of several prominent sources of sky emission 
(e.g. in the line of sight toward Trumpler 20 and NGG 4815 
open clusters), that can effectively broaden the fitted neb- 
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ular component (cr„ values up to limiting 0.5 A). However, 
in such cases, the nebular component is usually excl uded 
from the fit (lack of cr„ values above 0.4 A in Fig. 131 due 
to the conditions listed in Appendix |A.2[ 


Through an unbiased way we demonstrate that 3765 ob¬ 
jects (7698 spectra) exhibit intrinsic Ha emission, which is 
formed in the immediate vicinity of the observed object. 
This accounts for 30% of all 12392 distinct objects inves¬ 
tigated in the present study. The flags from Table and 
conditions in Table further enable us to present the dif¬ 
ferent morphologies of observed Ha profiles in the context 
of categories, which aim to identify and describe physically 
distinct groups of objects. The classification categories are 
presented in Figs. |3][^ where the parameters of fitted Gaus¬ 
sian profiles {gi, ^ and the typical profiles illustrate the 
contrast between them. 


The selection and definition of classification categories 
are based on previous experience with the different types of 
emission line profiles that are commonly observed in stellar 
spectra. Such profiles already have diverse physical inter¬ 
pretations and by assigning categories to our objects we 
merely aim to guide towards them. We note that the con¬ 
ditions in Table are discrete, which can cause arbitrary 
transitions between categories for spectra of individual ob¬ 
jects. This can also be tested by the reader with the use 
of the first five best fit solutions given in Table We 
also note that the order and the timespan of the consec¬ 
utive changes of categories through time is not accounted 
for in Table 1^ We therefore caution that sometimes, rapid 
changes of categories for individual objects can occur, even 
when one category is prevalent. We can additionally com¬ 
ment on the choice of one of the parameters from Table 
by examining how the classification behaves when we fix 
Kee = Kcyg = 0.9 and vary only Kgp- It can be shown 
that for our dataset, the first stability region or constant 
values of affected diagonal elements from Fig. arise just 
at Kgp « 50 km s“^. 

Repeated observations for 1430 objects enable us to de¬ 
rive some basic properties about their temporal variability 
as presented in Figs. [T0| and pT| Furthermore, 1191 objects 
with multiple spectra having two intrinsic components help 
determine the values of parameters Kee, Kcyg, and Kgp 
for our morphological classification scheme. Spectra in Fig. 
[T^ demonstrate the change of profiles with time and at the 
same time point to some of the characteristics of our clas¬ 
sification, which enables us to select a particular type of 
profile, caused by a specific underlying physics, and helps 
better understand the corresponding stellar family. 


A more detailed investigation of variability could 
extend to more emphasised study of line profile intensities, 
continuum variability, analysis of broad band photometry 
and SEDs for our sources, which can be obtained wit h the 
help of VO SA - Virtual Observatory SED Analyzer (Bayo 
et al.|[2008 l. Such subsequent analysis is beyond the scope 


of this work but might be pursued by other authors from 
the GES. Here, we present a more general approach with 
fewer fitted components to line profiles in order to obtain 
sensible Ha classification categories for each spectrum. It 
becomes harder to reach this objective in a more detailed 
and model dependent approach. The methods used for 
analysing Ha profiles and the results presented here could 
therefore form a valuable basis for future work. 
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A. DETECTION AND GAUSSIAN FITTING OF 
Ha EMISSION LINES 

When classifying Ha features we aim at separating line 
profiles exhibiting emission components from those with 
absorption-only profiles. We start with selecting 22 035 ob¬ 
ject spectra before background sky subtraction and contin¬ 
uum normalisation. This is important since sky subtraction 
can introduce spurious effects if the sky (nebular) Ha com¬ 
ponent is not correctly identified and treated for each in¬ 
dividual spectrum. Due to the large volume of our sample, 
the most practical way to attempt the detection of emission 
lines is by means of an automated procedure that we de¬ 
scribe below. Gaussian fitting and its details are described 
next. 


A.l. Detection 


The method employed for detection and parametrisation of 
Ha line profiles includes several steps and conditions. The 
wavelength values used throughout this work are heliocen¬ 
tric. Normalisation of spectra is done first by extracting 
only the wavelength ranging from 6549.8 to 6582.8 A so 
as to avoid the [N H] emission lines and then calculating 
the median of the outermost 10 A on each side. Division 
of spectra with this median value then yields a satisfactory 
normalisation, as demonstrated in Fig. 


The adopted central wavelength of Ha is Aq = 6562.83 
A. We often notice strong emission lines at ~ 6553 A and 
^ 6577 A in our sample of spectra and so we define the left 
(6555 A) and right (6575 A) boundary in order to avoid 
false detections when searching for Ha emission. We note 
that the mentioned emission lines may be of atmosp heric 
origin due to [OH] excitation (Osterbrock et al. |l9^ . The 
selected region of spectra between the initial boundaries 
defined above is in our experience large enough to include 
the whole Ha profile. Next, we set the first condition for 
exclusion of Ha absorption: 


1 — min{y) > 3{max{y) — 1) 


( 2 ) 


where the value 1 denotes the continuum level, and y is the 
flux of the selected normalised region. If this condition is 
satisfied, the spectrum is not considered any further, which 
means that we can only detect emissions if the amplitude of 
the highest peak above the continuum in the selected nor¬ 
malised region is at least one third of the amplitude of the 
strongest absorption. Note that we therefore refrain from 
detection of faint chromospheric emission activity, which 
sits on a strong absorption component. Treatment of such 
weak emission is beyond the scope of this paper because 
it would require a detailed m odelling of the stellar a tmo¬ 
sphere. We re f er the readers to Lanzafame et al. (20151 and 


Frasca et al. (2014| for a discussion ot this topic. If the 
above condition is not fulfilled, the procedure continues by 
automatically inspecting the values of y by going step by 
step (pixel by pixel) from the left initial boundary (6555 A) 
to the right (6575 A) and vice-versa. The main task here 
is to define the smallest region in which emission is located 
and to exclude false detections due to strong noise or cos¬ 
mic rays. We therefore need to accurately define the limits 
of the potential Ha emission as near to the observed profile 
as possible. It turns out it is better to set two conditions to 
achieve this goal, first for broader and second for narrower 


lines (narrow emission over wide absorption): 

y[fc-|-40]-y[A:] > d & y[fc] ^ 1 (3) 

y[k + 10] — y[A:] > 2.5d & 1 — y[fe] < {max{y) — l)/3 (4) 

where k is the running pixel index and d = {max{y) —1)/5. 
One pixel corresponds to 0.05 A. These two conditions are 
valid for left to right direction, and the ones for reverse 
are symmetrical. The second argument of both conditions 
restricts the value of flux to either above 1 (continuum) for 
wider emission or below 1 for relatively narrow emission 
over wider absorption. When the two conditions fail, the 
spectra are excluded from further consideration. 

Although they might seem arbitrary, the conditions (1^, 
and are set according to our experience for the 
best overall detection performance. 11 230 spectra out of 
22 035 are selected when making use of the condition (1^, 
while 1965 are subsequently excluded with conditions ® 
and (^, among which there are also some realistic emission 
profiles. We therefore note that the described procedure is 
not perfect and its performance also depends on the S/N. 
Besides excluding emission profiles, the above conditions 
are also liable to produce false emission detections in case 
of strong noise or cosmic rays in the spectra. Generally, such 
cases are few and an automatic procedure usually allows us 
an accurate and rapid detection of Ha emission for thou¬ 
sands of spectra. Out of 22 035 spectra, 9265 are identified 
as emission-type profiles. Many of these spectra are those of 
objects with multiple observations both in the GES inter¬ 
nal data release or ESO archive. Therefore there are 4566 
unique objects in our sample. 


A.2. Gaussian fitting 


We perform more thorough analysis of our selected sample 
of 4566 objects and their 9265 spectra by means of Gaussian 
fitting of the spectral region around Ha. This is also where 
we tackle with the problem of identification of the nebular 
component, which might contribute an important part of 
the emission flux if not being the only source of emission. As 
already mentioned, the majority of targets in our sample are 
stars in young clusters and star forming regions, which are 
often wrapped in a nebular environment, so it is essential 
we separate this external contribution of flux in Ha from 
the one intrinsic to a given object. 

All Gaussian fitting referred to in this paper is done with 
the least-squares Levenberg-Marquardt algorithm. We use 
a sum of two independent Gaussian components gi , g 2 and 
one optional gn to account for the nebular component. The 
nebular component of Ha is indicated by the presence of 
[N H] forbidden lines (6548.05 A, 6583.45 A) in the spectra 
as well as by the sky spectra in the vicinity of observed tar¬ 
gets. Whether we include gn into the fitting routine depends 
on the assessment of the two mentioned indicators. 

The fitting procedure start s by putting the continuum 
of normalised spectra (see Sect. A.l) to 0. We disregard the 
396 spectra with low S/N (~ 4) displaying negative fluxes. 

Next, we evaluate the indicators for the presence of the 
nebular Ha component. The preferred indicator is the pres¬ 
ence of [N H] emission around Ha. Both [N H] forbidden 
lines are fitted with Gaussian profiles (gNi, gN 2 ) with Gaus¬ 
sian parameters {XNi,crNi, ^Ni, Xn 2 , crN 2 , An 2 ) where Ajvi 
and X]s [2 are allowed to shift ±1 A and the fitting range is 
within ±2 A of the canonical centre of the [N H] lines. The 
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a Ml and aM 2 values fall in the range of 0.1 to 0.5 A and all 
three Gaussian parameters are then determined with the 
least-squares method. The three conditions that must be 
fulfilled to accept this first indicator are: (i) both Ami and 
Am 2 must be greater than 0.5 to compensate for noisy spec¬ 
tra, (ii) /pixel divided by Ami,m 2 must be < 1 . 0 , and (iii) 
||AAri-AAr2|-(6583.45 A -6548.05 A )| < 0.1 A. If these 3 
conditions are met, the wavelength parameter A„ for is 
fixed to A„ = Aati -I- (6562.83 — 6548.05) A and the (t„ pa¬ 
rameter is fixed to the average of a mi and crjv 2 multiplied 
by the ratio Hoct/NIIo. explained in the next paragraph. 
When these conditions are not satisfied, we continue with 
evaluation of emission in sky spectra. 

Out of 132 available fibres for Giraffe spectrograph, 
around 24% in the full sample of 28 957 spectra are used to 
record background sky spectra and these provide valuable 
information about the object’s surroundings. Ideally, there 
should be one sky (background) exposure per object, which 
would enable us to more accurately determine the nebular 
contribution to the flux from stars in young clusters. Here 
we take five spectra from nearest surrounding fibres clos¬ 
est to each of the observed objects. Maximum distance is 
set to 5° but usually all five fibres are located within 3' of 
the object fibre, which roughly translates to 1 pc at the 
distance of 1 kpc. These five sky spectra are averaged and 
all three emission lines (Ha and both [N H]s) are fitted in 
the same manner, as described in the previous paragraph. 
The ratio Hao-ZNIIg. = 1.3 is obtained from average values 
of aMi,o'N 2 and cr„ in all evaluated sky spectra. Similarly 
to the first indicator of nebular emission, the same three 
conditions are applied here together with an additional one 
stating that |A„ — 6562.83 A | < 0.1 A. 

When neither of the conditions from the previous two 
paragraphs are satisfied, is not included in the fitting 
procedure, hence we only use gi and g 2 . The source/lack of 
nebular indication is noted in Table The component 
has A„ and (j„ fixed and only A^ is fitted. For gi and g 2 
the Ai ^2 and ai _2 parameters vary in a certain interval with 
a fixed step whereas Ai ^2 is also fitted by the fitting proce¬ 
dure. The step size of cti 2 is the same for all spectra. We 
set the lower value to 0.15 A based on the resolution of the 
spectrograph (R ^ 17 000 ) and the upper value to 2 A that 
is sufficient for the broader components of the profiles. The 
use of 15 values that are logarithmically spaced allows us 
to better cover the regime of narrow lines to the detriment 
of broader ones. The Ai 2 parameter has 21 linear interme¬ 
diate values, which cover the range between the lower and 
upper boundaries, but the boundaries vary from spectrum 
to spectrum as described in the following paragraph. 

We aim to fit the smallest possible range including the 
whole Ha profile. This is partly done in the emission de¬ 
tection stage described at the beginning of this section, but 
here we want to define the region with absorption as well 
as emission parts of a generally multicomponent Ha pro¬ 
file. We therefore limit the fitting range first to 5 A (~ 230 
km s“^) at each side of Aq to avoid the already mentioned 
emission features and to ensure that the maximum initial 
grid step for Ai 2 parameters in the fitting procedure does 
not exceed 0.5 A. Next, the procedure proceeds from the 
middle to each side pixel by pixel (1 pixel is 0.05 A), eval¬ 
uating each consecutive chunk of 20 pixels ahead for the 
change in sign of the residual spectrum (normalised flux - 
1). When the change in sign occurs at least three times in 
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a given interval of 20 pixels, the procedure assumes that 
the continuum is reached and sets the furthermost pixel of 
the current chunk to be the limiting boundary. If this con¬ 
dition is not reached, the boundary (the limiting interval 
\xmin,Xmax]) stays at ±5 A from Aq. We use 21 linearly 
spaced values to split the interval between the lower and 
upper boundaries. From the intervals of values for Ai ,2 and 
(Ti 2 we get 49 770 combinations of gi and g 2 that can be 
summed together with according to criteria in the above 
paragraphs. 

The results of the fit give us the values of Ai^ 2 ,n) o'i. 2 ,ra 
and Ai^ 2 ,n of the best fitted gi + g 2 + gn, along with the 
reduced (x^ por pixel). The same procedure is then re¬ 
peated once more assuming Ai = A 2 , as already described 
in Sect.[^ and the final adopted set of parameters is the one 
achieving a better match between observed and modelled 
Ha profiles. 


A.3. Fit statistics 

The distribution of the values obtained for the parame¬ 
ters describing gi,g 2 , and g^ are shown in Fig. |13| The 
double-peaked profile in the histogram implies that ob¬ 
jects with indicated nebular component belong to distinct 
nebular environments or clusters with different radial veloc¬ 
ity. Histogram of cr„ shows a strong peak at 0.39 A, which 
corresponds to the instrument resolution, indicating a neb¬ 
ular emission with an intrinsic FWHM smaller than the 
instrumental profile. 

The x/. parameter might be misleading where one or 
more emission components are relatively strong, therefore 
we construct another measure of the goodness of fit - x^f: 
which is the previously defined normalised with average 
flux where the total flux of the line is the integral 

J I 5 i(A) + 52 (A)-I- 5 „(A)| dA (5) 


over the whole fitting range {[xmin,Xmax], see above) de¬ 
termined for each spectrum. The dist rib utions of Xr 

Due to the nor- 
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xif for all spectra are plotted in Fig. 
malisation of Xr for the strongest emissions, the tail on the 
right is reduced and the distribution shifted to lower values 
when going from Xr fo Xrf- bump on the left of the 
Xrf distribution is mainly due to the very strong nebular 
emissions where Xr i® small and F is relatively large. 


B. IDENTIFICATION OF FITTED 
COMPONENTS 

In order to implement our morphological classification of 
Ha line profiles, we use the parameters of fitted 51,521 and 
5 „ to determine their type, relative strengths and to what 
degree one blends with another. We define a set of rules, 
which enables us to assign one of the following flags to each 
spectrum: E, EE, EA, AE, N, EN, EEN, EAN, AEN. Let¬ 
ters E, A, and N denote intrinsic emission, intrinsic absorp¬ 
tion, and nebular emission, respectively. The number of let¬ 
ters in each flag matches the number of components, with 
the exception of the least interesting ones for our study (i.e. 
N, AN, and AAN) where the emission is solely due to the 
nebular component. These three flags are therefore joined 
into flag N. The order of letters E and A represents the 
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Fig. 13: Histograms of Gaussian parameters for the 4459 emission type objects. We show the values for components gi 
and g 2 (top panels), along with those for gn (bottom panels). When the nebular component is not included in the fit, 
their values are left out, except for An that are stacked in the first bin (value = 0). 



Fig. 14: Histograms of Xrf (solid line) and Xr (dashed line) 
representing the formal goodness of fit for 8846 spectra. The 
horizontal axis is logarithm of Xr where square 

root of Xrf represents relative 0-C. 


relative strength (absolute equivalent width) of the corre¬ 
sponding components where the first one is stronger (e.g. 
EA - absorption is weaker than emission, EAN - the same 
with additional nebular emission component). 

The rules are defined in the outline with conditional 
statements in Fig. [T^ where one of the branches deals with 
flags without nebular emission and the other with those 
including nebular emission (N). 

For this ruleset, we define fi (where i = l,2,u) as the 
integrated flux of Gaussian component gi, which is given 


Table 4: Number of emission type spectra for each of the 
nine flags. 

E EE EA AE N EN EEN EAN AEN 

“TiS 2TOI 1408 940 1148 642 938 838 483 


by fi = Ai(7i\/^. Next we define the following margins: 


Sf = OAF 

£a = 0.1 

£rr = 0.1 


( 6 ) 

( 7 ) 

( 8 ) 


which, according to our tests, enable the best performance 
when flagging individual spectra and then assigning distinct 
morphological categories in the second step of the morpho¬ 
logical classification scheme. The flux / is subject to signif¬ 
icant variations in absolute value and therefore £f depends 
on the integrated Gaussian solution F (Equation ([^). By 
doing this, we account for the very strong emissions where 
Gaussian components, which are relatively weak, but still 
significant in absolute value, must be weighted appropri¬ 
ately. 

The main purpose of the conditions in Fig.[^is to deter¬ 
mine whether different fitted Gaussian profiles gi,g 2 , and 
gn are similar enough in terms of their A and a parameters 
to represent the same feature (component) of the line profile 
or if they are so different to indicate two or more distinct 
components. This can only be evaluated under the limita¬ 
tions of the sampling scheme for the fitted parameters, the 
resolution of the spectra and their S/N. The defined set of 
rules and the margins £f,£\, and are therefore selected 
accordingly. The number of spectra that are assigned a cer¬ 
tain flag is listed in Table 

We exclude spectra with flag N from further discussion. 
Thus we have 990 single intrinsic emission spectra and 6709 
spectra with a combination of two intrinsic components, 
flagged EE(N), EA(N), and AE(N). 
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fn ^ 


E: /i > 0 

& /2 > 0 

& 

Ai - 

A 2 I < Ex 

& (Tl - 

0-2 

< e<T 

l/i| < £/ & /2 > 0 1 

1 l/2|<£/ V /i > 0 

EA: /i > 0 

& /2 < 0 

& 

I/ll 

> 1 / 2 ! II 

/i <0 

& 

/2 >0 

& l/2|>|/l| 


AE: /i > 0 

& /2 < 0 

& 

I/ll 

< 1 / 2 ! II 

/i <0 

& 

/2 >0 

& l/2|<|/l| 


EE: /i > 0 

& /2 > 0 

& 

I/ll 

^Ef & 

I/ 2 I ^ £/ 






fn ^ 

- N: 

|/i|<e/ & l/2|<£/ II 

|Al-An|<£A & |A2-A„|<eA & \(ri - an\ < Ea- & \a2 - a„\ < Eir & /l > 0 & /2 > 0 II 
I/ 2 I < £/ & |Ai-A„|<eA & |(Ti-cr„|<£a & /i > 0 II 

|/i|<e/ & |A2-A„|<eA & |o-2-o-„|<£a & /2 > 0 


— EN (AN if the last conditional term in each line does not hold): 


Ai 

- A„| 

< £a 

& 

ui - cr„ 

< £tr 

& 

fi>0 

& 

1A 2 

- A„| 

bj 

A\ 

& 

1/2 

is 


& 

/2 >0 

1A 2 

- A„| 

< £a 

& 

0-2 — cr„ 

^ E(j 

& 

/2 iso 

& 

Ai 

- A„| 

bJ 

A\ 

& 

1/1 

is 


& 

/i >0 

Ai 

- A„| 

bj 

A\ 

& 

I/ll ^ £/ 

& 

I/ 2 I 

< £/ 

& 

/i >0 

II 








1A 2 

- A„| 

bj 

A\ 

& 

I/ 2 I ^ £/ 

& 

I/ll 

< £/ 

& 

/2 >0 

II 








Ai 

- A„| 

< £a 

& 

ui — cr„ 

^ £<t 

& 

I/ 2 I 

< £/ 

& /i > 0 

II 







1A 2 

- A„| 

< £a 

& 

0-2 — cr„ 

^ £<t 

& 

I/ll 

< £/ 

& /2 > 0 

II 







Ai 

— A 2 I 

< £a 

& 

(Tl — (72 

Efj 

& 

/2 4 

-/i >0 










AAN: /i 

sS 0 

& 

/2 ^ 0 















AN: (/i 

^ -£/ 

& 

I/ 2 I < £ 

f) II 

if 

2 ^ - 


& 

I/ll 

< £/) 









— EAN, AEN, EEN: The same conditions apply as for EA, AE, EE 

Fig. 15: Set of rules that treats each fit following conditions from top to bottom and from left to right. The top part of 
the scheme (/„ ^ £/) only characterises the profiles with stellar component(s), while the second branch (/„ > e/) also 
includes the nebular component. In Table we merge the categories AN, AAN, and N because the emission is solely 
due to the nebular component. 


C. SUPPLEMENTARY INFORMATION FROM 
EXTERNAL SOURCES 

We extract a collection of data from several sources. As 
the coordinates of objects are the most reliable search pa¬ 
rameters, we use them to retrieve information from SIM- 
BAD, VizieR, and ADS on-line databases. Epoch 2000.0 
coordinates of our objects are not identical with those from 
the catalogues, so we adopt matching limits as described 
below. Tables and provide supplementary information 
from VizieR, SIMBAD, and ADS databases. 

C.l. Object types from SIMBAD 

We use a search radius of 1 arcsec. Altogether 2546 out of 
4459 objects have a SIMBAD match satisfying this crite¬ 
rion. Typical accuracy of the match is ^ 0.3 arcsec, im¬ 
plying that the matches we found are secure. We examine 
the general description of their object Wpe in the SIMBAD 
database (i.e. the “otype” flag). Table Im lists the most com¬ 
monly encountered “otype” flags for our targets. While the 
first two categories are general definitions, those with more 
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than 15 occurrences clearly indicate their nature. These are 
mostly young, active stars, probably in their early phases 
of evolution. Some of the objects have been detected as an 
X-ray or a radio source while, interestingly, there is only a 
handful of known interacting binary stars. Further exami¬ 
nation of objects from our catalogue is thus a promising way 
to discover several objects belonging to the latter category. 

C.2. References from the ADS database 

References from the literature should serve as possible ad¬ 
ditional information about objects of interest, they are not 
to be taken as reliable sources of the characteristics of a cer¬ 
tain object. 305 out of 4459 objects are matched success¬ 
fully with references from the ADS database. The search 
is done based on object coordinates and separately on the 
main SIMBAD identifier, the latter being justified by the 
fact that ADS and SIMBAD work together in retrieving the 
relevant literature. When searching by SIMBAD identifier, 
the restriction to objects with angular distance ^ 1 arcsec 
of matched records in SIMBAD is applied. The results are 
further filtered by the presence of keywords “Ha”, “emis- 
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Table 5: Distribution of objects according to their classifi¬ 
cation flag “type” of the SIMBAD database. N denotes the 
number of objects from our catalogue that fall into each 
category. 

N otype 

1025 star in cluster 

356 star 

244 young stellar object 

198 variable star of Orion type 
177 low-mass star (M < Mq) 

131 young stellar object candidate 

85 pre-main sequence star 

75 T Tau-type star 

73 pre-main sequence star candidate 

61 emission-line star 

47 infrared source 

20 x-ray source 

16 flare star 

5 variable star 

3 radio source 

2 brown dwarf (M < 0.08 Mq) 

2 brown dwarf candidate 

2 carbon star 

2 Cepheid variable star 

2 eclipsing binary of W UMa type (contact) 

2 Herbig-Haro object 

2 open (galactic) cluster 

2 Red Giant Branch star 

2 variable star of Mira Get type 

1 Be Star 

1 dark cloud (nebula) 

1 double or multiple star 

1 eclipsing binary 

1 HII (ionized) region 

1 millimetric radio source 

1 object of unknown nature 
1 rotationally variable star 

1 spectroscopic binary 

1 sub-millimetric source 

1 variable of BY Dra type 

1 white dwarf candidate 


ordered by Xr Table but for all relevant discussion 
in the main text, only the best result for each spectrum is 
used. 

Some of the objects observed in GES are known to be 
of peculiar type and are already discussed in the literature 
or listed in different catalogues. For all the 4459 analysed 
objects, we present results from the search in ADS, SIM¬ 
BAD, and VizieR databases in Tables and This search 
is detailed in Appendix [C We note that 305 out of 4459 
objects are mentioned in the list of ADS references (Table 
| 8 1 , and according to SIMBAD, roughly 25% of 4459 objects 
already have an indication of Ha emission. The electronic 
version of the catalogue will be made publicly available at 


the CDS. 


sion”, “objective prism survey”, “cataclysmic”, “symbiotic”, 
“binary”, “outburst”, and “pulsat” in the title of each paper. 

C.3. VizieR sources 

We perform queries by coordinates to retrieve the proper¬ 
ties of our sources contained in VizieR catalogues. Results 
are presented for each of the following wavelength ranges: 
Gamma-ray, X-ray, EUV, UV, Optical, IR, and Radio. For 
1853 objects in Table we list the number of VizieR tables 
in which a match is found within an angular distance of up 
to 1 arcsec. 


D. CONTENTS OF THE CATALOGUE 

The catalogue contains a master table with the main results 
for each analysed spectrum, and its extension for unique 
objects, which provides supplementary information to the 
reader. Their content is respectively detailed in Tables 
and[^ Each fitted spectrum is identified through the object 
name (i.e. the GNAME keyword build from the coordinates 
of the object) and the date of observation. We give the 
first five results of the fitting procedure for each spectrum 
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Table 6: Description of the content for the catalogue of 8846 spectra whose full table will be available at the CDS. 


Label 

Unit 

Description 

DATEOBS 

CNAME 


Date and time of the observation 

Sexagesimal, equatorial position-based source name in the form: hhmmssss+ddmmsss. Thus 
the CNAME of an object at 3h 40m 21.s767 and -31° 20' 32.71" is 03402177-3120327. 

RA 

0 

RA (J2000) 

DEC 

0 

DEC (J2000) 

Halpha lambda 1 

A 

five comma separated values - first five solutions of the fit ordered by Xr for the central 
wavelength of the first Gaussian profile 

Halpha sigma 1 

Halpha peak 1 

A 

As Halpha lambda 1 but for the a parameter of the first Gaussian profile 

As Halpha lambda 1 but for the height of the peak of the first Gaussian profile in units of 
normalised flux 

Halpha lambda 2 

A 

As Halpha lambda 1 but for the central wavelength of the second Gaussian profile 

Halpha sigma 2 

Halpha peak 2 

A 

As Halpha lambda 1 but for the a parameter of the second Gaussian profile 

As Halpha lambda 1 but for the height of the peak of the second Gaussian profile in units 
of normalised flux 

Halpha lambda neb 

A 

As Halpha lambda 1 but for the central wavelength of the nebular Gaussian profile 

Halpha sigma neb 
Halpha peak neb 

Halpha neb indication 

Halpha red chi 

Halpha red chi f 
Halpha flag 

Halpha category 

Halpha intrinsic 

A 

As Halpha lambda 1 but for the a parameter of the nebular Gaussian profile 

As Halpha lambda 1 but for the height of the peak of the nebular Gaussian profile in units 
of normalised flux 

0 - no indication of nebular emission, 1 - presence of [N II] emission in object spectrum, 2 - 
presence of Ha and [N H] emission in sky spectra (see Appendix A.21 
five comma separated values - Xr for first five solutions of the fit 
five comma separated values - Xr/ for first five solutions of the fit 

Flag of spectrum obtained from our ruleset in Fig. [fSl (see text) 

Gategory obtained from our classification scheme in 'iable|^ (see text) 

1 for intrinsic stellar emission, otherwise NULL (see text) 


Table 7: Description of the content for the supplementary information of 4559 sources whose full table will be available 
at the CDS. 


Label 

CNAME 

RA 

DEC 

Halpha_profile_change 

SIMBAD_main_id 

SIMBAD_angular_distance 

SIMBAD_identifiers 

SIMBAD_otype 

SIMBAD_stype 

ADS literature 


VizieR_n_Radio 

VizieR_n_IR 

VizieR_n_optical 

VizieR_^n_UV 

VizieR_n_EUV 

VizieR_n_Xray 

VizieR_n_Gammaray 


Unit Description 


as CNAME in Table 
as RA in Table l6 
as DEC in Table ^ 

0 for no temporaTchange in classification category of spectra from repeated observa¬ 
tions, otherwise NULL (see text) 

Main ID of the source in SIMBAD found for a given CNAME, otherwise NULL 
Angular distance of the object to the source SIMBAD_main id, otherwise NULL 
All SIMBAD identifiers for the source SIMBAD_main_id, otherwise NULL 
SIMBAD object type for the source SIMBAD_main_id, otherwise NULL 
SIMBAD spectral type for the source SIMBAD_main_id, otherwise NULL 
A comma-separated list of indices which correspond to the first column of Table 
where we list the ADS references ordered by the number of occurrences for all analysed 
objects, otherwise NULL 

Number of VizieR tables for Radio wavelength range in which object with CNAME 

has a match, otherwise NULL 

As VizieR_n_Radio but for IR wavelength range 

As VizieR_n_Radio but for optical wavelength range 

As VizieR_n_Radio but for UV wavelength range 

As VizieR_n_Radio but for EUV wavelength range 

As VizieR_n_Radio but for X-ray wavelength range 

As VizieR_n_Radio but for Gamma-ray wavelength range 


Table 8: Reference list (98 rows, full table will be available at the CDS). We only list the references with Noccur ^ 50, 
which denotes the number of objects from the catalogue that have corresponding article in the search results from ADS 
(see appendix C.2|. 


id 

T 

2 

3 

4 


157 

98 

98 

67 


5 66 

6 63 


title 

Ho Emission-Line Stars in Molecular Clouds. I. The NGC 2264 Region 
Catalogue of H-alpha emission stars in the Northern Milky Way 
Catalogue of stars in the Northern Milky Way having H-alpha in emission. 
Primordial Circumstellar Disks in Binary Systems: Evidence for Reduced Life¬ 
times 

Emission-Line Stars Associated with the Nebulous Cluster NGC 2264. 

New H-alpha-emission stars in Monoceros OBI and R1 associations 


reference 


i.1.1 (120041) 

Wehmeye 


Reipurth et al 


Kotioutek fe vVehmeWr 


Kohoutek & Wehm eyer 
(Jieza et al. 1 2UU9 1 


Herbigljly^ 


( 1999 1 

(TW7l 


Ugura|(|TMf 
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